CDS

Accession Number TCMCG024C51764
gbkey CDS
Protein Id XP_022033842.1
Location join(139932030..139932719,139932824..139932882,139933659..139933728,139934046..139934171,139934538..139934666,139935820..139936248)
Gene LOC110935798
GeneID 110935798
Organism Helianthus annuus

Protein

Length 500aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA396063
db_source XM_022178150.2
Definition nucleolar protein 12 [Helianthus annuus]

EGGNOG-MAPPER Annotation

COG_category A
Description RNA-binding protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03009        [VIEW IN KEGG]
KEGG_ko ko:K14837        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005634        [VIEW IN EMBL-EBI]
GO:0005730        [VIEW IN EMBL-EBI]
GO:0031974        [VIEW IN EMBL-EBI]
GO:0031981        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043228        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0043232        [VIEW IN EMBL-EBI]
GO:0043233        [VIEW IN EMBL-EBI]
GO:0044422        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044428        [VIEW IN EMBL-EBI]
GO:0044446        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0070013        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGGGAAGAAGAAATCAAAAGAACCCAAACCTGAAATCCCCAATTCATCACCTCAACCTTCATCAGATAACATTTTCAAATCCCTTTTCGGCAGCACCCATCAAGAACCCAATGACCCATCCTCTGTTTCAATCTTCTCAGATTCCAACCCTTTCCGAACCAAACCCACCAAAGATTCCCAGAAAGATCCACAAATCCTTCAACCAAACACCATTATCTCCCAAAATAACGACACCCAGATCCCCAATTCTCCCAATAACCCATTGAACAAGAAGAAAAACGAGAAATCACCCAAAAAAGTTGATCAAGACACCAAGAAATCAAATATCCCAAATGGGGTTGTTTCAGAAAGTCCAAAGAGCTCAAAAAGTTTGAGTCTTGATGAGGTGGATGAGAATAAGAAGAAGAAAAAGAAGAAGAAGGCTGAGGTTGAAGCAGAGTATGAGGAAAGAAAGTATGGAGGGGTGGATTTGGAATTAGAGAAAGATAAGGTGGCAAAAGGGAAAATTGGGGAGAAGAGGAAGGGGGTGGATGTTTTGAAAGAGGGATTTGATGATGAAGAGAAGCTTTCAAGAACTGTGTTTGTTGGGAATCTGCCTTTGAAGGTGAAGAAGAAGGCATTGTTGAAAGAGTTTAGTCAGTTTGGAGAGATAGAATCCGTTCGAATTCGATCTATTCCTTTATTAGATGATAAGACTCCAAGAAAGGGTGCTGTGATCAAGAAGAAAATCAATGATGCTGTTGACAGGGTTAATGCATACATTGTTTTCAAGACCGAAGATTCCGCCCAAGCTTCTTTATCACATAACATGGCAGTTGTGGGTGGAAATCATATTCATGTAGACAGAGCTTGTCCGCCACGTAAGAAACTTAAGGGAGAAAATGCTCCTCTCTATGACAGCAAAAGGACTGTTTTTATTGGTAACCTCCCATTCGACGTCAAGGATGAAGAACTTTATCAGCTGTTTACTGGTTTTAACAATCTGAAAGACTGCATAGAGGCGATTCGAGTGGTGAGAGATCCTGGTACAAGCATGGGAAAGGGCATTGCTTATGTCTTGTTTTCAACACGGGAAGCTGCAAATACGGTTGTTAGAAAACATAAACTGAAGATCCGAGACAGGGAGCTGAGGTTATCTCATGCCTTGAAAGCAAGCGCATCAACACCATCAAAAAACAAGGAATCATCATCCACAAACAGTTACAGTTCTGCTAAGAAGGCGGCTGTGGGCGGAAACGCCTCATACCAGGGAATACGGGCCACCAAATCCGGTGGCCAGAAGAAGTTTGCGACCAGAATAACTAAACCTGGCAGGAGTGAATCAAGAAGTGAAACTGTGGTGAAGCGAAAAGTACGTTCGGAAAAGAGGCCAGCGGTTGCTGCTAGAAAGGCGGCAGCAGTTGCATCTAAAACTGGCGGTGATAGCGGCGCCGGAGGTGTGAAACGCAAGGCTAAACCAGAGAGTAATAACCGGAATAAGAAACCTAGGAAATTCAGATAG
Protein:  
MGKKKSKEPKPEIPNSSPQPSSDNIFKSLFGSTHQEPNDPSSVSIFSDSNPFRTKPTKDSQKDPQILQPNTIISQNNDTQIPNSPNNPLNKKKNEKSPKKVDQDTKKSNIPNGVVSESPKSSKSLSLDEVDENKKKKKKKKAEVEAEYEERKYGGVDLELEKDKVAKGKIGEKRKGVDVLKEGFDDEEKLSRTVFVGNLPLKVKKKALLKEFSQFGEIESVRIRSIPLLDDKTPRKGAVIKKKINDAVDRVNAYIVFKTEDSAQASLSHNMAVVGGNHIHVDRACPPRKKLKGENAPLYDSKRTVFIGNLPFDVKDEELYQLFTGFNNLKDCIEAIRVVRDPGTSMGKGIAYVLFSTREAANTVVRKHKLKIRDRELRLSHALKASASTPSKNKESSSTNSYSSAKKAAVGGNASYQGIRATKSGGQKKFATRITKPGRSESRSETVVKRKVRSEKRPAVAARKAAAVASKTGGDSGAGGVKRKAKPESNNRNKKPRKFR